Investigating the Security Threat Arising from “Yes-No” Implicit Bias in Large Language Models

Published in AAAI 2025, 2024

Recommended citation: Sendong Zhao, Du et al. (2025). "Investigating the Security Threat Arising from “Yes-No” Implicit Bias in Large Language Models; AAAI 2025.
Download Paper